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EFFECT OF SCALE ON LONG-RANGE RANDOM GRAPHS AND 
CHROMOSOMAL INVERSIONS 

By Nathanael Berestycki and Richard Pymar 

University of Cambridge 

We consider bond percolation on n vertices on a circle where 
edges are permitted between vertices whose spacing is at most some 
number L = L{n). We show that the resulting random graph gets 
a giant component when L S> (logn)^ (when the mean degree ex- 
ceeds 1) but not when L <^ logn. The proof uses comparisons to 
branching random walks. We also consider a related process of ran- 
dom transpositions of n particles on a circle, where transpositions 
only occur again if the spacing is at most L. Then the process ex- 
hibits the mean-field behavior described by Berestycki and Durrett if 
and only if L{n) tends to infinity, no matter how slowly. Thus there 
are regimes where the random graph has no giant component but the 
random walk nevertheless has a phase transition. We discuss possible 
relevance of these results for a dataset coming from D. repleta and D. 
melanogaster and for the typical length of chromosomal inversions. 

1. Introduction and results. 

1.1. Random graphs results. Let n > 1 and let L = L{n) > 1. Define ver- 
tex set V = {1, . . . ,n} and edge set TZl = {{hj) £ IN ~ ill ^ -^1) where 
\\i — j\\ denotes the cyclical distance between i and j, that is, \\u\\ = min(|u|, 
n — \u\) for G y. In this paper we consider bond percolation on V where 
each edge in TZl is open with probability p. Equivalently, let {G{t),t > 0) be 
the random graph process where a uniformly chosen edge oiTZi is opened in 
continuous time, at rate 1. Let A^(t) > A^(t) > • • • denote the ordered com- 
ponent sizes of G{t). At a fixed time t this corresponds to the above model 
with p = \ — exp{— t/(nL)}. When L = 1, this is the usual bond percolation 
model on the cycle of length n, while for L{n) = n/2, we find that G{t) is 
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a realization of the much studied random graph model of Erdos and Renyi 
[see Bollobas (1985) and Durrett (2010) for background]. Hence, our random 
graph model interpolates between these two cases. 

In this paper we are interested in the properties of the connected compo- 
nents of G{t), particularly those related to the possible emergence of a giant 
component when the average degree exceeds 1. The main result of this paper 
shows that this depends on the scale L{n). To state our results, we let c > 
and consider t = cn/2, so that the expected degree of a given vertex in G{t) 
converges to c when n — )• oo. Let A^(t) > A^(t) > ■•• denote the ordered 
component sizes of G{t). 

Theorem 1. Let t = cn/2, where c> is fixed as n — >• oo. 

(i) If c < 1, then there exists C < oo depending only on c such that 
A^(t) < Clogn with high probability as n — )• oo. 

(ii) If c> 1 and there exists ^ > such that L[n) > (logn)^+^, then there 
is a unique giant component; more precisely, 



in probability as oo, where 0{c) is the survival probability of a Poisson(c) 
Galton- Watson tree. Moreover, A^(t)/n— t-O in probability. 
(iii) However, if c> 1 and L = o(logn), then for all a> 0, 



in probability as oo. In particular there are no giant components. 

Statement (i) is fairly easy to prove using the standard technique of ap- 
proximating the size of a component in the graph by the total progeny of 
a branching process. The result follows since in the case c < 1 we know that 
the total progeny of the branching process is almost surely finite and has 
exponential tails. 

Part (ii) is the most challenging. We start by noting that the exploration 
of the component containing a given vertex v may be well-approximated 
by the trace of a branching random walk where the step distribution is 
uniform on {—L,...,L}. This approximation is valid so long as the local 
density of the part of the component already explored stays small. Thus, 
showing the existence of a giant component requires a balancing act; we 
need to ensure that the local density of what we explore stays small enough 
to ignore self-intersections, but large enough for global connections to occur. 
Careful estimates on survival probabilities of killed branching random walks 
are used to achieve this. 

Part (iii) is the easiest to prove, and requires showing the existence of 
many "blocking" intervals of size L which consist just of vertices with de- 
gree 0. When there are many such intervals, no giant component can exist. 



(1) 




n 



(2) 
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1.2. Long-range random transpositions. Theorem 1 was originally mo- 
tivated by the study of a question concerning long-range transpositions, 
which may itself be rephrased as a question in computational biology. We 
now discuss the question on long-range random transpositions and delay the 
applications to comparative genomics until Section 2. 

Recall the definitions of V and TZl in Section 1.1. Consider a random 
process (cTj,t > 0) with values in the symmetric group Sn, which evolves as 
follows. Initially, ctq = e is the identity permutation. Let (ii, Ji), (^2) ^2), • • • 
be an i.i.d. infinite sequence of pairs of elements of V, where each pair is 
uniformly distributed on TZl- Then we put 

(3) fjf =rAr, o • • • on, 

where for each A; > 1 we let Tk denote the transposition {ik,jk), {^tit > 0) 
is an independent Poisson process with rate 1 and o the composition of two 
permutations. That is, informally, if we view the permutation at as describ- 
ing the positions on the circle of n particles labeled by V [with crt{i) denoting 
the position of particle i £V], then in continuous time at rate 1, a pair of 
positions is sampled uniformly at random from TZl and the two par- 
ticles at positions i and j are swapped. Thus the case where L{n) > n/2 
corresponds to the well-known random transposition process (i.e., the com- 
position of uniform random transpositions), whereas the case where L{n) = 1 
corresponds to the case of random adjacent transpositions on the circle. 

Our interest consists of describing the time-evolution of 5{at), where for 
all a £ Sn we set 6{a) = n — \a\ and \a\ to be the number of cycles of a. By 
a well-known result of Cayley, this is the length of a minimal decomposition 
of a into a product of any transpositions (i.e., whose range is not neces- 
sarily restricted to TZl)- The reason for this choice will become apparent 
in subsequent sections and is motivated by the applications to comparative 
genomics. 

For c > 0, define a function 



It is known that u{c) = c/2 for c < 1 but u{c) < c/2 for c > 1 [see, e.g., 
Bollobas (1985), Theorem 5.12]. The function u is continuously differentiable 
but has no second derivative at c= 1. We shall prove the following results. 

Theorem 2. Assume L{n) — )• 00 as n —t- 00. Then we have the following 
convergence in probability as n — )• 00; for all c> 0, 



00 



(4) 




k=l 



(5) 
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In this result the distance between the two points being transposed at 
every transposition is uniform within {1, . . . , L(n)}. We wiU prove in The- 
orem 6 given in Section 5 a more general version of this result, where this 
length is allowed to be some arbitrary distribution subject to the condition 
that there are no "atoms in the limit," which is the equivalent of requiring 
here L{n) — t- oo. 

By contrast, the microscopic regime (where L is assumed to be constant 
or to have a limit) shows a remarkably different behavior. 

Theorem 3. Assume limn^ooL{n) exists. Then we have convergence 
in probability; for all c> 0, 

-Sicrcn/2) ^v{c) 
n ' 

as CO, for some function v{c) which satisfies < v{c) < c/2 for all 
OO. 

As we will describe in greater detail later on, there is a connection be- 
tween the long-range random transposition process and the random graph 
process of Theorem 1. Roughly speaking, when L{n) is bounded, we expect 
v{c) < c/2 because each new edge has a positive probability of having its two 
endpoints in the same connected component. Alternatively, the branching 
random walk which is used to explore the connected component of a vertex 
has a positive probability of making a self-intersection at every new step. 

The mean- field case where L{n) = n/2 recovers Theorem 4 of Berestycki 
and Durrett (2006). Theorem 2 above relies on a coupling with the random 
graph G{t) of Theorem 1; this coupling is similar to the coupling with the 
Erdos-Renyi random graph introduced in Berestycki and Durrett (2006). 
In that paper, the emergence of the giant component in the Erdos-Renyi 
random graph was a crucial aspect of the proofs. As a result, one might 
suspect that the phase transition of S{at) is a direct consequence of the 
emergence of a giant component in the random graph. However, one par- 
ticularly surprising feature of Theorem 2 above is the fact that the limiting 
behavior described by (5) holds for all L{n) — )• 00, no matter how slowly. 
This includes in particular the cases where L(n) = o(logn) and the random 
graph G{t) does not have a giant component. Hence, for choices of L{n) 
such that L{n) — )• 00 but L{n) = o(n), the quantity 6{at) has a phase tran- 
sition at time n/2, even though the random graph G{t) does not get a giant 
component at this time. 

1.3. Relation to other work, and open problems. Long-range percolation. 
A similar model has been studied by Penrose (1993). There the model consid- 
ered is on the infinite square grid Z"^, rather than the finite (one-dimensional) 
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torus which we consider here. In the infinite case, d = 1 is trivial since per- 
colation (occurrence of an infinite cluster) only occurs if p = 1 for obvious 
reasons. Penrose studied the case d>2 and showed that if c is the expected 
degree of the origin, and L the maximum distance between the two ends 
of a bond, where the parameter L — t- oo and c is fixed, then the percola- 
tion probability approaches 6{c) [where 6{c) is the same as in (1), i.e., the 
survival probability for a Galton-Watson process with Poisson(c) offspring 
distribution]. As is the case here, his arguments use a natural comparison 
with branching random walks. 

It is interesting that, while the infinite case is essentially trivial when 
d = l, the finite-n case is considerably more intricate than the infinite case, 
as witnessed by the different behaviors in (1) and (2) depending on how 
fast L{n) —7- oo. Regarding the finite-n situation, it is an interesting open 
question to see whether there are giant components if t = cn/2 and c > 1 
with logn < L(n) < (logn)^. Another interesting problem concerns the size 
of the largest components when there is no giant component, in particular, 
if L = o(logn). Indeed, our proof makes it clear that when L{n) — )• oo, even 
if the largest component is not macroscopic, there is a positive proportion 
of vertices in components of mesoscopic size. We anticipate that as c> 1 is 
fixed and L increases, the size of the largest component, normalized by n, 
jumps from to 6{c) as L(n) passes through a critical threshold between 
logn and (logn)^. As pointed out by a referee, this is suggested by a work of 
Aizenman and Newman (1986) on long-range bond percolation on Z where 
the connection probability between vertices at distance x > decays like 
Their main result (Proposition 1.1) shows that such discontinuities 
occur in this case. 

Epidemic models. The question of giant components in random graph 
models can, as usual, be rephrased in terms of epidemic processes. More 
precisely, fix a vertex v and a number p S (0, 1). Consider an SIR epidemic 
model that begins with all vertices susceptible but vertex v infected. Once 
a vertex is infected, it transmits the infection to each of its neighbors in the 
base graph (y,E) at rate A > and dies or is removed at rate 1. Then the 
total size of the epidemic is equal to the size of the component containing v 
in the random graph with edge-probability p = A/(l -|- A). As pointed out 
by an anonymous referee, Bramson, Durrett and Swindle (1989) consider 
the related SIS model (or contact process) on Z*^ where, as here, long-range 
connections are possible. Similar techniques are employed as in this article 
to calculate the critical rate of infection and the probability of percolation. 
Letting infections occur at rate \/ \o\B{L) where B{L) is a ball or radius L 
in Z"^, they show that the critical infection rate Ac converges to 1 in all 
dimensions as L — )■ oo. They also identify the rate of convergence, which 
turns out to depend on the dimension in an interesting way. 
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Higher- dimensional analogs of Theorem 1. Our proofs do not cover the 
higher-dimensional cases but it would not be very difficult to adapt them. 
In particular, the analogue of (1) would hold if d > 2 no matter how slowly 
L{n) — 7- cx). In other words, only for the one-dimensional case is it important 
to have some quantitative estimates on L{n). Intuitively this is because, in 
one dimension, one is forced to go through potentially bad regions whereas 
this problem does not arise in higher dimensions. 

Regarding site percolation, we point out that recently Bollobas, Janson 
and Riordan (2009) have described an interesting behavior for a site per- 
colation model on the torus in dimensions d >2 where two vertices are 
joined if they agree in one coordinate and differ by at most L in the other. 
For d = 2 they show that the critical percolation probability, Pc{L), satis- 
fies \\m.L^aoLpc{L) =log(3/2). This is surprising as the expected degree of 
a given vertex at the phase transition is then strictly greater than 1. There 
again, approximation by branching random walks plays an important role 
in the proof. 

Slowdown transitions for random walks. In the mean- field case L{n) = 
n/2 of uniformly chosen random transpositions, the quantity 5(0"^) may be 
interpreted as the graph-theoretic distance between the starting position of 
the random walk (cjo = the identity element) and the current position of the 
walk. Theorem 2 in this case [which, as already mentioned, is Theorem 4 
of Berestycki and Durrett (2006)], may thus be interpreted as a slowdown 
transition of the evolution of the random walk; at time n/2, the acceleration 
[second derivative of S{at)] drops from to — oo. By contrast, Berestycki 
and Durrett (2008) studied the evolution of the graph-theoretic distance 
in the case of random adjacent transpositions. This essentially corresponds 
to the case L = 1, with the difference that the transposition (1 n) is not 
allowed. They found that no sudden transition occurs in the deceleration of 
the random walk. It would be extremely interesting to study the evolution 
of the graph-theoretic distance of the random walk when L = L{n) is a given 
function that may or may not tend to infinity as n — t- oo. Unfortunately, this 
problem seems untractable at the moment as it is far from obvious how to 
compute (or estimate) the graph distance between two given permutations. 
[We note that even in the case L = 1 where the transposition (1 n) is allowed, 
this question is partly open; see Conjecture 3 in Berestycki and Durrett 
(2008).] Nevertheless it is tempting to take Theorems 2 and 3 as an indication 
that a slowdown transition for the random walk occurs if and only if L(n) — )• 
oo, with the phase transition always occurring at time n/2. 

Organization of the paper. In Section 2 we show how Theorems 2 and 3 
relate to a biological problem and in particular discuss the possible rele- 
vance of these results for a dataset coming from two Drosophila species. 
In Section 3 we state and prove results on the evolution of the clusters in 
a random graph which evolves in a more general way to G{t). In Section 4 
we give a proof of Theorem 1 . Section 5 contains a proof of a result stronger 
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Table 1 

Order of the genes m D. repleta compared to their order in D. melanogaster 
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than Theorem 2 using the more general random graph process defined in 
Section 3. Finally, in Section 6 we present the proof of Theorem 3. 

2. Applications in comparative genomics. 

2.1. Statement of problem and history. Part of the motivation for this 
paper comes from a biological background, more specifically, in answering 
a question about the evolution of the gene order of chromosomes. We begin 
with an example. In 2001 Ranz, Casals, and Ruiz located 79 genes on chro- 
mosome 2 of Drosophila repleta and on chromosome arm 3R of Drosophila 
melanogaster. While the genetic material is overall essentially identical, the 
order of the genes is quite different. If we number the genes according to 
their order in D. repleta then their order in D. melanogaster is given in 
Table 1. 

Since the divergence of the two species, this chromosome region has been 
subjected to many reversals or chromosomal inversions, which are moves 
that reverse the order of whole gene segments. Because they involve many 
base pairs at a time rather than the more common substitutions, insertions 
and deletions, these mutations are called large-scale. They are usually called 
inversions in the biology literature, but we stick with the word reversal as 
"inversions" is often used among combinatorists with a different meaning 
[see, e.g., Diaconis and Graham (1977)]. One question of interest in the field 
of computational biology is the following: How many such reversals have 
occurred? 

Hannenhalli and Pevzner (1999) have devised a widely used algorithm 
which computes the parsimony distance, the minimal number of reversals 
that are needed to transform one chromosome into the other (this will be 
denoted here by doo)- By definition, the number of reversals that did occur is 
at least doo- Berestycki and Durrett (2006) complemented this by rigorously 
analyzing the limiting behavior of the discrepancy between the true distance 
and the parsimony distance [described by the function u{c) in Theorem 2], 
under the mean-field assumption that all reversals are equally likely. 
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However, that assumption does not seem to be entirely justified and it 
might be more accurate to restrict the length of the segment being reversed. 
According to Durrett (2003), "To seek a biological explanation of the nonuni- 
formity we note that the gene-to-gene pairing of homologous chromosomes 
implies that if one chromosome of the pair contains an inversion that the 
other does not, a loop will form in the region in which the gene order is 
inverted. ... If a recombination occurs in the inverted region then the re- 
combined chromosomes will contain two copies of some regions and zero 
of others, which can have unpleasant consequences. A simple way to take 
this into account is. . . [to] restrict our attention to the L-reversal model." 
The reasoning here is that as the length of the segment reversed increases, 
the probability of recombination increases. Here, the L-reversal model is 
to allow only reversals that switch segments of, at most, length L and all 
such reversals have equal probability. A further argument can be seen in 
Durrett (2002) who argues that not all inversions occur at the same rate; 
when a large amount of DNA is absent from a chromosome, the offspring is 
typically not viable, so longer inversions will occur at a lower rate. 

2.2. Estimating the number of chromosomal inversions. To estimate the 
number of chromosomal inversions (or reversals) in the long-range spatial 
model, one natural idea is to use the parsimony approach; that is, compute 
the d^-distance (minimal number di of L-reversals needed to transform one 
genome into the other) and then prove a limit theorem for the evolution 
of diit) under random L-reversals. However, this appears completely out 
of reach at this stage; the crucial problem is that we do not know of any 
algorithm to compute the L-reversal distance. [Even in the case L = 1, if 
particles are lying on a circle, this is a delicate problem; see Conjecture 3 
in Berestycki and Durrett (2008).] Thus, even if a limit theorem could be 
proved, we would not know how to apply it to two given genomes. 

In order to tackle this difficulty, we propose here the following alternative 
approach. We keep looking at the doo-distance (minimal number of reversals 
needed to transform one chromosome into the other, no matter their length) 
but now we think of d^o only as an easily computed statistic on which we can 
make some inference, even though not all reversals were equally likely. More 
precisely, we are able to describe the evolution of the quantity doo{t) under 
the application of random L-reversals, and use that result to estimate t from 
the data doo{t). 

We first state the result in this context, and illustrate our idea with a nu- 
merical example in Section 2.3. The distance doo is defined in terms of an 
object known as the breakpoint graph. For definitions of these notions we 
refer the interested reader to Chapter 9 of Durrett (2002). For signed per- 
mutations o", 0"' we let 6{a, a') = n + 1 — c, where c is the number of compo- 
nents of the breakpoint graph. In general [see Durrett (2002), Theorem 9.1], 
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(ioo(f, o"') > 6{a,a'). The quantity 5 {a, a') ignores obstacles known as "hur- 
dles" and "fortresses of hurdles." All these are thought to be negligible in 
biologically relevant cases, so we will use 6{a,a') as a proxy for doo{cr,a'). 
Let (Tf be the signed permutation obtained by composing Poisson(t) inde- 
pendent L-reversals. We slightly abuse notation and write 5{at) for 5{ao,at). 

Theorem 4. Assume that L(n)— ^co. Then 

n ' 

However, when L{n) stays bounded, we get a behavior similar to Theo- 
rem 3. 

Theorem 5. Assume lim„„j.oo-^('^) exists. Then we have convergence 
in probability; for all c> 0, 

-H(^cn/2) -^W{c) 

as n ^ oo, for some function w{c) which satisfies < w{c) < c/2 for all 
OO. 

The proofs for these two results are verbatim identical to those of Theo- 
rems 2 and 3. The choice of stating our results for transpositions is merely 
one of convenience, as transpositions are easier to describe and more familiar 
to many mathematicians. 

2.3. Numerical application to Drosiphila set. We now illustrate on the 
dataset from Table 1 the possible relevance of Theorems 4 and 5. We first 
compute the parsimony distance in this case. Here there are n = 79 genes, 
and even though the orientation of each gene is not written, it is not dif- 
ficult to find an assignment of orientations which minimizes the parsimony 
distance doo{(j). We find that the parsimony distance is doo{(y) = 54. 

First assume that all reversals are equally likely, or that L is large enough 
that the behavior described in Theorem 4 holds, and let us estimate the 
actual number of reversals that were performed. We are thus looking for t 
such that dooicTt) = 54 when n = 79. Changing variables t = cn/2, we are 
looking for c > such that u{c) = 54/79 ~ 0.68. Thus, inverting u we find c ~ 
1.6 and hence, we may estimate the number of reversals to be around t = 63. 
Note that the discrepancy with parsimony (doo = 54) is already significant. 

This estimate keeps increasing as L decreases and the behavior of The- 
orem 3 starts kicking in. For instance, with L = 4 (so that L/n ~ 5%), 
simulations give c ~ 2.4 or t ~ 95 reversals, or 175% of the initial parsimony 
estimate! 
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Ideally, we would want to use estimates in the biology literature on the 
typical range of reversals, in combination with the results of this paper, to 
produce a refined estimate. Kent et al. (2003) estimated the median length 
of a reversal in human/mouse genomes to be about 1 kb, corresponding very 
roughly speaking to L being a few units, say 1 < L < 4. (However, they find 
a distribution for the reversal lengths which is bimodal and hence, quite 
different from the one we have chosen for simplicity in this paper.) Other 
estimates we found in several biology papers differed by several orders of 
magnitude, so that there does not appear to be a consensus on this ques- 
tion. Instead, we performed some reverse engineering, and compared our 
method with other existing methods. York, Durrett and Nielsen (2002) used 
a Bayesian approach in a model comparable to ours. The mode of the pos- 
terior distribution was at t ~ 87, with the parsimony estimate lying outside 
the 95% confidence interval [see Durrett (2002), Section 9.2.2, for further 
details]. This suggests that L is slightly more than 4, broadly speaking con- 
sistent with the estimate of Kent et al. (2003). 

2.4. Simulations for transpositions. We complement the above example 
with plots (see Figure 1) to show how 6{t) = 6{at) evolves with t for finite n 
by straightforward MCMC, averaging over 1000 simulations in each case. 
The dotted line shows n(c) and the solid line shows the average over the 
simulations. We observe that as L increases, u(c) provides a better estimate 
to the parsimony. 




(d) 



(e) 



(f) 



Fig. 1. Here (a) n = 100, L = 1; (b) n = 100, L = 5; (c) n = 100, L = 50; (d) n = 1000, 
L = 1; (e) n = 1000, L = 50; (f) n = 1000, L = 500. 
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3. Evolution of the components of the random graph. We begin by prov- 
ing a few results relating to the components of a random graph which evolves 
in a more general way than previously defined. For each n > 2, fix a prob- 
ability distribution (p")i<£<[n/2j • We will omit the superscript n in all cal- 
culations below in order to not overload the notation. For the rest of this 
section we redefine {G{t),t > 0) to be the random graph process where at 
rate 1 we choose a random variable D according to the distribution (pi), and 
open a uniformly chosen edge from those of graph distance D. We define 

(6) En ■■= max Pi. 

l<e<[n/2i 

We begin by analyzing how the components in the random graph G{t) evolve 
over time. 



Lemma 1. Let C{t) he the connected component of G{t) containing some 
fixed vertex v £V , and let t = cn/2 for some c > 0. Assume that En 0. We 
have that C{t) ^ Z (where ^ stands for stochastic domination) and 

|C(t)|— )-Z as n —)• oo 

in distribution. Here Z is the total progeny of a Galton-Watson branching 
process in which each individual has a Poisson(c) number of offspring. 

Remark 1 . The argument below is simpler to follow in the case where (pi) 
is the uniform distribution in {1, ... , L(n)}. We will need a more precise ver- 
sion of this lemma later on, in Lemma 5. It may thus be helpful for the reader 
to look at Lemma 5 first in order to understand the main idea of the proof 
of Lemma 1. 



Proof of Lemma 1. We use the breadth-first search exploration of the 
component C{t). That is, we expose the vertices that form G{t) by looking 
iteratively at neighborhoods of increasing radius about v. In doing so, the 
vertices of G{t) are naturally ordered according to levels i = 1,2, . . . which 
represent the distance of any vertex from that level to the vertex v. To be 
more precise, if AcV let M{A) denote the neighborhood of A, that is, 

M{A) = {y € y : y ~ x in G{t) for some x G A}. 

Let A^ = {v} and then define inductively for i > 0, 

i 

A'+^ =M{A')\^[jAK 

j=0 

The statement of the lemma will follow from the observation that when t = 
cn/2, the sequence (l^^l, lA-*^], . . .) converges in the sense of finite-dimensional 
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distributions toward {Z^, Z^, . . .), the successive generation sizes of a Poisson(c) 
Gallon- Watson process. Thus, fix an integer-valued sequence (no,ni,...) 
with riQ = 1. We wish to show that 

P(|^°| =no,...,\A'\= m) ^ P(ZO = no,...,Z' = m) 

as n — 7- oo, which we do by induction on i > 0. The statement is trivial 
for i = 0. Now let i > 0. Given Ai = {A^ = 71q, . . . , A* = nj}, we look at the 
neighbors in level z + 1 of each vertex vi, . . . , in level i, one at a time. 

Let G{t) be the multigraph on V with identical connections as G(t), but 
where each edge is counted with the multiplicity of the number of times 
the transposition (i, j) has occurred prior to time t. Equivalently, for each 
unordered pair of vertices G V"^ at distance ||i — j|| = ^ > 1, consider an 
independent Poisson process N^^'^\t) of parameter 2pi/n. Then the multi- 
graph G{t) contains N^'^'^\t) copies of the edge (i,j), while the graph G{t) 
contains the edge if and only if N^^'^\t) > 1. 

Note that if w £V, then the degree of w in G(t) is 

dw = Poisson{2tpi/n) =d Poisson(c). 

Let J-i = cr(A^, . . . ,A^). Conditionally on J-^, order the vertices from A^ in 
some arbitrary order, say vi, . . . ,Vni- Observe that 



^^+1 = u 



j-1 



Af{vj)\i[jA^u[jAf{v,) 

j=l L \i=0 k=l / 

It follows directly that, conditionally on J^i, 

rii 

(7) \A'+'\^J2p^^ 

i=i 

where Pj are independent Poisson(c) random variables which are further 
independent from J^j. (The stochastic domination \Cv\ ^ Z already follows 
from this observation.) For 1 < j < n^, let J^jj = J^iV a{J\f{vi) U • • •UM{vj)). 
Observe that, conditionally given then 



(8) K 



-1 



M{v,)\l[jA^u[jMiv,) 

\j=0 k=l 

is stochastically dominated by Pj but also dominates a thinning of Pj 
which is a Poisson random variable with parameter c(l — Men), where 
M = uq + ■■■ + rii + \J\f_{vi)\ + • • • + |A/'(Dj_i)|, where Af{w) denotes the 
neighborhood of w in G{t) (hence, neighbors are counted with multiplic- 
ity). Furthermore, note that the random variables {Nj, 1 < j < rii) are con- 
ditionally independent given J^j. Since E(Me„|J^j) — )• by the stochastic 
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domination (7), it follows that 

This completes the induction step and finishes the proof of convergence 
in distribution. □ 

A useful consequence of this result is the following lemma. 

Lemma 2. Let t = cn/2, where c > 0. Then as oo, the number, Kt, 
of connected components of G{t) satisfies 

oo , k—2 

(9) E(irO~nJ]^(ce-^)^ 

k=l ^ 

Proof. For u E y, let be the component containing vertex v. Then 
observe that the total number of components is given by X^„gy and thus 

by exchangeability, the expected number of components is nE(l/|C^|). Di- 
viding by n and applying the bounded convergence theorem (since 1/ 1 C„ | < 1 ) 
as well as Lemma 1, we obtain 

1 oo ^ oo , k—2 

-nKt)^Y.inz=k)=Y,^{ce-^)\ 

k=l k=l 

where the exact value P(Z = k) of the probability mass function of Z is 
the well-known Borel-Tanner distribution [see, e.g., Berestycki and Durrett 
(2006), Corollary 1]. □ 

We now prove that the number of components Kt is concentrated around 
its mean. 

Lemma 3. Let c > and let t = cn/2. Assume that En — s- 0. We have 
Kt/K{Kt) — 7- 1 m probability as n — t- oo. 

Proof. We write Kt = Yt + Wt where Yj counts the components smaller 
than a threshold T = (l/e„)^/^ and Wt those that are greater than this 
threshold. Note that Wt < n/T = o{n) and thus it suffices to show that Yt 
is concentrated around its mean, that is, Var(lt) = o(n^). 

Note that we can always write 



and thus 

Yav{Yt)=nYav{{l/\Cy\)l{\c,\<T}) + ^Cov(-^l^^c^\^T^ 
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Since l/|Ct,| < 1, the first term in the right-hand side is smaller than n. De- 
fine = j^l{|c„|<T}, = ]^1{|C„|<T}- To know the value of and S^, 
it suffices to explore by breadth-first search a relatively small number of ver- 
tices in the components of v and w. While we do so, it is unlikely that the 
exploration of these components will ever intersect, hence, the random vari- 
ables Sy and 5^ are nearly independent. 

To formalize this idea, let Cj (resp., C^) denote the subset of (resp.. Cm) 
obtained by exploring at most T individuals using breadth-first search as 
above. Let be a copy of 5„, independent from Cw Then condition- 
ally on C^, exploring until at most T vertices have been exposed using 
breadth- first search, we may take = except if Cj intersects with C^, 
an event which we denote by A. (To see this, imagine generating an inde- 
pendent copy C^, using the same number of offsprings and positions for 
each individual in the breadth-first search exploration of Cj as in, but stop 
if at any point Cj has an intersection with C^.) 

Thus, letting rriy = K{Sy) = K{Sy), since is independent from C^, and 

~ n 

since Sy = Sy on A , 

E{Sy - my\Cl) = E{{Sy " S y)\C^) + E{{S y " my)\C^) 

= EiiSy - Sy)U\Cl) + E{{Sy " Sy)l_^o\Cl) 

= E{{Sy-Sy)lA\Cl) a.s., 
and thus since < Sy < 1 and < S't, < 1 , 

|E(5,„-m,|C^)| <2P(^|C;^) a.s., 

so that 

Cov(S„,5^) <4P(^). 

Now observe that by Markov's inequality, P(^) < E(e(Cj, C^)), where 
e{A,B) denotes the number of edges between A and B. Since |Cj| <r and 

\C^\<T by definition, we have E(e(Cj, C^)) < ce„TV2 = 0{el/^) 0. The 
lemma follows. □ 

4. Proof of Theorem 1. 

4.1. Connection with branching random walk. In this section we return 
to considering the random graph model {G{t),t > 0) as given in the Introduc- 
tion. The proof of (i) in Theorem 1 is easy and follows directly from the 
observation that for a given vertex v, \Cy\ is stochastically dominated by Z, 
the total progeny of a Poisson(c) Galton- Watson tree (see Lemma 1). When 
c < 1 it is easy to see that there exists A > and C < oo such that P(Z > 
k) < Ce~'^^ . Taking k = Mogn with b sufficiently large, (i) now follows from 
a simple union bound. 
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We turn to the proof of (ii) in Theorem 1, which is the most challenging 
technically in this paper, and assume that c > 1. The key to the investi- 
gation of the properties of G{t) with t = cn/2 is the following observation, 
which connects the geometry of a given component to the range of a certain 
branching random walk. We start by introducing notation and definitions. 
Let r be a Galton-Watson tree with a given offspring distribution and de- 
note by Ti the ith. level of the tree T. Let {S{v),v € T) denote a T-indexed 
random walk. That is, let (X(e))egT be a collection of i.i.d. random vari- 
ables with a prescribed step distribution, and for all vertices v £T, define 
S{v) = S{o) + J2e^v-^'"^ where the sum e runs along all edges that are 
on the shortest path between the root o and v. 

Let t = cn/2. Let w £V, say w = 0, and let C = Cw be the component 
containing w in G{t). Consider the breadth-first exploration of C introduced 
in Lemma 1. RecaU that =M{A^) \ Uj=o^^- Observe that it could be 
that two vertices w,w' € each select a same neighbor z. We refer to this 
type of connection as a self-intersection. We view each A* as a subset of Z 
by identifying V with 



The following is a warm-up for the more complicated kind of couplings which 
will be needed later on. 

Lemma 4. Let c> and let t = cn/2. For each k>l, 



weakly in distribution as ?i — )■ oo, where {Sy)v^T denotes a branching random 
walk started from with offspring distribution Poisson(c) and step distribu- 
tion uniform on ( — 1, 1), and 6x denotes the Dirac pointmass at x. 

Proof. The proof of the lemma is an easy extension of Lemma 1, since 
in the case where there are no self-intersections, all the displacements of 
the children of vertices in any given generation form i.i.d. uniform random 
variables on Ml = {—L, . . . , —1, 1, . . . , L}. Details are left to the reader. □ 

In practice, the finite-dimensional distribution convergence result of Lem- 
ma 4 will not be strong enough as we will typically need to explore more 
than a finite number of generations. The following lemma strengthens this to 
show that the breadth-first exploration of a cluster may be coupled exactly 
with a slightly modified branching random walk up until the first time the 
latter has a self-intersection. More precisely, let T be a Galton-Watson tree 
with offspring distribution Poisson(c), and let Sv,v £T, be defined as above 



{-Ln/2J +1,.. .,-1,0,1,..., rn/21}. 
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except that if u G T with offspring vi, . . . ,Vk [let denote the edge {v,Vi)], 
we define the displacement variables X{ei), . . . , X{ek) to be sampled with 
replacement uniformly from Ml- The sampling is still done independently 
for different vertices v £T. We call this process branching random walk with 
replacement for future reference. We also introduce a version with erasure, 
where if v and w are such that = Su, (what we call a self-intersection) 
with V discovered before w in the breadth-first search, then the entire de- 
scendance of w is ignored or killed. We call this process an erased branching 
random walk and denote it by {Sy,v G T). 

Lemma 5. Let {Si,,v € T) denote a branching random walk as above and 
{Sv,v € T) its corresponding erasure. Then there exists a coupling of {Sv)y^T 
and {A^,i > 0) such that the sets and {Sx,,v G Tj} coincide exactly for each 
i >0. In particular, let r be the first self-intersection level; r = inf{n > 1 : 
3v ^ w £ V{Tn),Sv = Sw}. Then we can couple and {Sv,v G Ti) for each 
i <T. 

Proof. For the most part this is a variation on Lemma 1, but there 
are some subtleties. Assume we are exploring the connections of a vertex 
V £ for some i>0. Let A be the 2L — 1 potential neighbors of v, and let 
B C Ahe the set of those within A which have already been exposed so far. 
For each of the \A \ B\ potential new neighbors of v to be added to A'^'^^, 
the edge joining it to v has appeared a Poisson(t/ (nL)) number of times. 
Of course, if an edge appears several times, this amounts to connecting to 
the same vertex, and this is why we choose sampling with replacement. The 
action of sampling uniformly with replacement from A or from A\B can 
be chosen to be identical, until the first time that sampling from A uses an 
element from B. The rest of the details are left to the reader. □ 

Remark 2. Note that by the classical birthday problem, r is unlikely 
to occur before at least of order \/L vertices have been added. Thus we can 
couple exactly the breadth-first search exploration of C^, and a branching 
random walk until of order ^/L vertices have been discovered. 

In fact, this will still not be strong enough and we will need to push 
this exploration until of order o(L) vertices have been discovered. Of course, 
self-intersections can then not be ignored, but there are not enough of them 
that they cause a serious problem, so the breadth-first search exploration 
coincides with "most" of the branching random walk. 

4.2. Survival of killed branching random walk. The basic idea for the 
proof of (ii) in Theorem 1 is a renormalization (sometimes also called "block" ) 
argument. 

We show that if the component of a given vertex is larger than some fixed 
number, then this component is likely to reach distance KL, where K > 
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is a large number (which may even depend on L) to be suitably chosen. 
This may be iterated to show that two points selected at random from V 
will be connected with probability approximately 0{cf', where 9{c) is the 
survival probability of T. For now, we will need a few basic estimates about 
killed branching random walks. In many ways, some of the results are more 
natural to state when we let L — )• oo rather than n — )■ oo. Since L{n) — )■ oo, 
the two statements are identical. 

Consider a branching random walk as above, started at v , with step 
distribution uniform in { — L, . . . , — 1, 1, . . . , L} and some arbitrary offspring 
distribution with probability generating function (j){s). By killed branching 
random walk (KBRW) we refer to a branching random walk where, in ad- 
dition, particles die if they escape a given interval containing the starting 
point. 

Lemma 6. Let 9 denote the survival probability of the branching random 
walk, that is, p = 1 — 6 is the smallest root of z = 0(z) . For each e > we 
can choose K = K{e, (j)) such that if all particles are killed upon escaping 
[v — KL,v + KL], then for all L sufficiently large (depending solely on £ 
and (j)) the survival probability 9^ of KBRW satisfies 9^ > 9{1 — e). 

Proof. Let T denote the Galton-Watson tree describing the descen- 
dants of V. Conditionally on survival of T, the subset U of T for which 
all vertices in U have infinite progeny (i.e., the set of infinite rays) forms 
a Galton-Watson process with modified progeny; the generating function 
satisfies 



Define My := [v — L,v + L]. Consider a subset W of U obtained as follows. 
Let 1^ = {uq,ui, . . . , ur) be a fixed ray in U where R is the first time the ray 
leaves [v — KL, v + KL] . Thus (S^q , , • • • , Suj^) is a random walk with the 
underlying step distribution, killed upon exiting [v — KL, v + KL]. Then W 
restricted to ^ will consist of the subsequence of (ttnj such that Su„- &Afv 
More precisely, we take W to be the union of all such subsequences over all 
rays ^ in U. The vertices of W have a natural tree structure, and we claim 
that W dominates a branching process where the offspring progeny is 



where ek = ^/{K + 2). The reason for this is as follows. Suppose u G W, so 
that Su € A^t,. Then u has (in U) a random number, say A^, of offsprings, 
where the generating function is given by (p in (10). Since the trajectory of 
a random walk with jumps uniform in {—L, . . . , — 1, 1, . . . , L} forms a martin- 
gale and the jumps are bounded by L, a classical application of the optional 



(10) 



4>{s) = lm0s + l-9)-l + 9]. 



(11) 



(I>k{s) = (j){eK + (1 - £k)s), 
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stopping theorem shows that any particular fixed ray emanating from each 
offspring of u returns to My before hitting v it KL with probability at least 
1 — £k- Formula (11) follows easily. Now, survival probability is therefore 
at least as large as the survival probability of the Galton-Watson process 
with offspring distribution given by (px- Let px be the extinction proba- 
bility. Then px = (pKipx) and pK is the unique root of this equation in 
(0, 1), and moreover, px is decreasing as a function of K. Since pK > 0, call 
p = liniK^oo Pk- It is trivial to conclude by continuity of cj) that p = (j){p) and 
that p < 1 , from which it follows that p is the extinction probability of (f) and 
is thus equal to 0. Thus we may choose K sufficiently large that px <£■ D 

We now consider a certain subprocess of the killed branching random walk 
and show that this also survives, growing exponentially and leaving many 
offsprings very near the starting point v. Rather than stating a general result 
we will state only what we need. Fix a function u}{L) such that uj{L) — )• oo 
sufficiently slowly, say u}{L) = logL, and let /o(i) be any function such 
that fo{L) < L/uj{L). Fix an integer d > 1, and explore no more than d 
offsprings for any individual, that is, declare dead any additional offspring. 
Fix A = XK,d and also declare a vertex v dead if the most recent common 
ancestor n of such that Su G Mv is more than A generations away. Refer 
to this process as KBRWi. Note that KBRWi is a subprocess of KBRW 
and thus of BRW . Note also that the erased KBRWi is a subprocess of the 
erased KBRW. 

Lemma 7. Assume that <?5'"(1) < oo so that the offspring distribution has 
finite second moments. For all e > 0, there exists K = K{e, (j)), d > 1 and A 
such that if all particles are also killed upon escaping [v — KL,v + KL], then 
for all sufficiently large L ( depending solely on cj) and e ), with probability at 
least (1 — £)9, the following hold: 

(i) KBRWi gets at least fo{L) descendants in at most clogfo{L) gen- 
erations for some c> 0, 

(ii) fo{L)/K of them are in My. 

Proof. Consider the KBRW of Lemma 6 and let W be as in the proof 
of that lemma. Consider WD KBRWi and note that this is a Galton-Watson 
process with offspring distribution which dominates one with a generating 
function given by (11), where now 1 — e/^ is the probability that a random 
walk with step distribution uniform on { — L, . . . , — 1, 1, . . . , L} returns to Mv 
before exiting [v — KL,v + KL], and that this takes less than A steps. By 
choosing K sufficiently large, d sufficiently large and A sufficiently large (in 
that order), sk is arbitrarily small and thus we find that KBRWi survives 
forever with probability at least (1 — e)9, as in Lemma 6. Note also that WD 
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KBRWi, being a Galton-Watson tree and having finite second moments, 
grows exponentially fast by the Kesten-Stigum theorem. Thus fewer than 
clog/o(L) levels are needed to grow W Ci KBRWi to size fo{L) for some 
c > 0, and so at this level we will certainly have at least fo{L) explored in 
KBRWi. 

Let T be the KBRWi stopped when the population size exceeds fo{L). 
Define the following marking procedure in KBRWi. Mark any node u £ 
KBRWi if the position of the branching random walk Su at this node is in 
the interval Mv Let C 7" be the set of marked nodes. Since by construc- 
tion, every node ugT has an ancestor at (genealogical) distance at most A 
which is a marked node, and since the degree of any node in T is at most 
d+ 1, it follows that 

(12) |A^|>,|r| where, = ^^^^^^i^-^. 

[To see (12), just notice that for every new mark, one can add at most l/rj 
nodes in the tree without adding a new mark, and proceed by induction.] 
For (ii) to occur, it suffices that |A^| > fo{L)/K. Since by construction \T\ > 
fo{L), choosing \ = XK,d> [{^ogK)/{logd)\ — 1 shows that (ii) occurs as 
soon as (i) holds. The proof of the lemma is complete. □ 

We now strengthen this last result by showing that the erased random 
walk also has a large number of offsprings in [v — KL,v + KL]. Further, 
we suppose also that there is a set F of locations which, if an individual 
lands on, results in that individual being removed. We call these forbidden 
locations. 

Lemma 8. Consider an erased branching random walk, started atvGV, 
with step distribution uniform in { — L, . . . , — 1, 1, . . . , L} and some arbitrary 
offspring distribution with probability generating function (p{s) with <^"(1) < 
oo. Suppose also that there is a set F of forbidden locations, with \F\ < 
L/co{L) and uj{L) — )■ oo. Let 9 denote the survival probability of the branch- 
ing random walk. For all e > we can choose K = K{e, (j)) such that if 
all particles are also killed upon escaping [v — KL,v + KL], then for all 
sufficiently large L (depending solely on (j) and e), with probability at least 
(1 — e)9, the following hold: 

(i) the erased KBRWi gets at least fo^L) descendants in at most 
clogfo{L) generations for some c> 0, 

(ii) fo{L)/K of them are in My. 

Proof. Let r be the first time that the killed branching random walk 
has more than 2/o(L) descendants. Let us show that the associated erased 
branching random walk has at least fo{L) individuals at that point with high 
probability. To see this, we first observe that by (i) in Lemma 7 the number 
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of generations, r, is at most clog/o(L) for some c > 0. Before time r, for 
each new vertex added to the branching random walk, the probabihty that 
it is a self-intersection or hits an element of F is no more than (2/o(L) + 
\F\)/{2L — 1). Thus the probability that a particular ray of no more than 
clog/o(L) generations contains a self-intersection is, by Markov's inequality, 
at most 

c(log/o(L))(2/o(L) + |F|) 
2L-1 

as L — 7- oo. Therefore, the number of vertices that are present in the KBRWi 
but not in the erased KBRWi is, by Markov's inequality again, at most 
(1/2) X (2/o(L)) = /o(L) with high probability. We shah denote by EKBRWi 
the erased KBRWi which has a set F of forbidden locations. 

By Lemma 7, we also know that 2fQ{L)/K individuals of the KBRWi 
population are located in Mv- Since we have just shown that the total number 
of individuals not in EKBRWi is o{fo{L)) with high probability, we deduce 
that at least fQ{L)/K individuals of EKBRWi sue located in M^- The proof 
of the lemma is complete. □ 

4.3. Breadth-first search explorations. The next three lemmas give us 
some information on the breadth-first search exploration of a component C„ 
of a given vertex v gV in the random graph G{t). For reasons that will 
soon become clear, we wish to assume that by the point we start exploring 
the component C^, part of the graph has already been explored (a vertex 
has been explored once all its neighbors have been observed). The part that 
has already been explored (denoted F) represents forbidden vertices, in the 
sense that since we have already searched F, the breadth-first search of Cy 
can no longer connect to it. 

We now specialize to the case where (j) is the generating function of 
a Poisson(c) distribution, and in all that follows we let 9 = 0{c) be the 
survival probability of a Poisson(c) Galton-Watson tree. It turns out that 
we need to separately treat the case where L is very close to n, and this will 
be done later in Lemma 15. 

Lemma 9. Fix c> 1, e > 0, v gV and fix K = K{e,c) as in Lemma 8. 
We assume that a set F containing at most L/uj[L) vertices have already 
been discovered in [v — KL,v + KL] (and v is not one of them). Then for 
all n large enough (depending only on e and c), if L <n/{2K), then with 
probability at least 9{1 — e), a search procedure of Cy can uncover at least 
f(){L)/K vertices of Cy in My without exploring more than fo{L) vertices in 
total in [v — KL,v + KL], and none outside. 

Proof. Consider the breadth-first search exploration of Cy, with the 
following modifications. We stop exploring the descendants of any vertex 
outside of [v — KL,v + KL]. We also completely stop the exploration when 
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more than fo{L) vertices have been discovered. Also, we stop exploring the 
descendent of any vertex in F and we fix d > 1 and truncate the offspring 
progeny at d, so that if an individual has more than d offspring, only the 
first d encountered are explored. This keeps the degree of any node in the 
genealogical tree bounded by d + 1. We choose d = dx as in Lemma 7. We 
also stop exploring the descendants of an individual if the time elapsed since 
the last time an ancestor of this individual visited Mi, exceeds A = XK,d, 
where XK,d is as in Lemma 7. We refer to this process as KBFSi. More 
formally, we use the following algorithm: 

Step 1. Set VLe = 0, = {v}- These correspond to the explored and 
active vertices, respectively. 

Step 2. If |r2£;| > /o(-Z>) we stop. Otherwise we proceed to Step 3. 

Step 3. Set il^v = 0- For each w G VLa, add its neighbors (excluding the 
parent of w) to until d have been added, or there are no more. 

Step 4. Add the vertices in VLa to Qe- 

Step 5. Set ^a = ^n\{^e^ F}- If ^A = 0, then we stop. 

Step 6. Remove from VLa all vertices outside of [v — KL, v + KL] and those 
that do not have an ancestor in Mv fewer than A generations away. 

Step 7. Go to Step 2. 

This exploration can be exactly coupled with the EKBRWi considered 
up to the first time r that the total population size exceeds fo{L), by taking 
in Lemma 8 the set F as it is defined here. Lemma 9 thus follows directly 
from Lemma 8. □ 

To establish the existence of a connection between two vertices v and w, 
it will be useful to add another twist to the breadth-first search exploration 
of Cy and C^, by reserving some of the vertices we discover along the way. 
That is, we decide not to reveal their neighbors until a later stage, if nec- 
essary. This allows us to keep a reserve of "fresh" vertices to explore at 
different locations and that we know are already part of Cy or C^. To be 
more precise, let e > 0. Let 1 < c' < c be such that 0{c') > 6{c){l — e/2). Let z/ 
be small enough that c{l — v) > c' . When exploring Cy through a method 
derived from breadth-first search, we choose which vertices to reserve as fol- 
lows: for each new vertex that we explore, if it has any offsprings, we choose 
one uniformly at random, and reserve it with probability independently 
of anything else. (See below for a rigorous formulation.) Note, in particular, 
that the set of vertices that get reserved is dominated by a Poisson thin- 
ning of the original exploration procedure, with thinning probability i'. Let 
K = K{e/2,c') be as in Lemma 8. Note that with this choice of v and K, 
the survival probability 0' of EKBRWi is at least 

(13) e\c) > e{c'){l - e/2) > e{c){l - e) 

for all L sufficiently large (depending solely on c and e). 
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Thus, starting from a vertex v, a branching random walk killed when 
escaping [v — KL,v + KL] with this reservation procedure survives forever 
with probability at least 9{c){l — e). From this we deduce without too much 
trouble the following result. 

Lemma 10. Fix c > 1, e > 0, v & V . Let v = v{e) as above, and assume 
that a set F containing no more than L/io{L) vertices have been discovered. 
Then for all sufficiently large n (depending solely on e and c), if L <n/ {2K), 
the following hold with probability at least (1 — 2e)9: 

(i) A search procedure can uncover at least fi{L) = fQ{L)/K vertices 
in Mv without uncovering more than 2/o(L) vertices in total in [v — KL, 
V + KL], and none outside. 

(ii) At least 5fo{L) vertices are reserved inNv, for 6 = {1 — e~'^)v{£) / [AK) . 

Proof. We apply the above reservation method to KBFSi (see the 
proof of Lemma 9). Formally, we introduce a set of reserved vertices 
(initially = ) . We use the same algorithm as for the modified breadth- 
first search but now Step 7 becomes: 

Step 7'. Partition $7^ into classes of vertices with the same parent in the 
exploration. Choose uniformly from each class a representative and with 
probability v this representative is added to VLr and removed from Q.^- Go 
to Step 2. 

We call this new search procedure KBFS2. Let r be the time we have 
discovered fo{L) nonreserved vertices. At this time the total number of ex- 
plored vertices is less than 2/o(L) and thus, similar to the proof of Lemma 9, 
we can couple the exploration with an erased KBRWi where the offspring 
distribution has a slightly modified offspring distribution (a randomly cho- 
sen offspring is removed with probability v). We call this an erased KBRW2- 
Reasoning as in Lemma 9, and using (13), we see that (i) holds with proba- 
bility at least 0(1 — e), provided that n is large enough and L < n/{2K). For 
each new vertex exposed by KBFS2 in Nv, it has a reserved offspring in Nv 
with probability at least (1 — e~^)v/2, as if u S Nv and X are uniformly 
distributed on {—L, ...,—1,1,..., L}, then u + X & Mv with probability at 
least 1/2. Thus (ii) follows from (i) and from Chebyshev's inequality. □ 

With this lemma we are now able to show that a vertex v connects 
to V ziz KL with probability essentially 9, and that many vertices in the 
same component may be found without revealing too much inside [v — KL, 
v + KL]. 

Lemma 11. Fix c > 1, e > 0, and let K be as in Lemma 9. Let < 
C < ^' < 1/2 and let v £V. Assume that a set F of no more than L/lo{L) 
vertices have already been explored in \v — KL, v + KL] and v is not one 
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of them. Let Bk,v denote the event that v is connected to at least unex- 
plored vertices in the range [v + KL, v + {K + 1)L] which may be discovered 
by searching no more than vertices. Then for all sufficiently large n 
(depending solely on e, c, Q and C,' ), if L < n/(2(Ko(e) + 1)), then 

Pfc.)>e(c)(l-e). 

Proof. Consider the KBFS2 exploration of C^, stopped when a total of 
/o(i) = /2 vertices of have been exposed (additional to those exposed 
initially). By Lemma 10, with probability at least 0{c){l — e) if n is large 
enough and L < n/{2K), this search reveals at least k = 6fo{L) reserved 
vertices within Mv, and no more than vertices in the range [v — KL, 
v+KL] have been explored (let Av denote this event). On Av, label vi, . . . ,Vk 
the first k such vertices to have been discovered in M^- After this stage, we 
then continue the KBFS2 exploration started from only, until 

a total of L^ /2 further vertices are exposed. Note that the exploration can 
be coupled with a system of k erased KBRW2, started from vi,. . . ,Vk. The 
total number of vertices searched by the end of the second stage will be 
no more than /o(L) + L<' /2 < L^/^. Thus, as in Lemma 8, the probabil- 
ity each particular vertex gives rise to a self-intersection is no more than 
{\F\ + VL)/{2L - 1) ^ as n ^ 00. 

Moreover, using domination by a branching process (Lemma 1), it is easy 
to see that the number of generations for the /2 vertices to be discov- 
ered by the branching random walk is at least 61ogL for some b> 0, with 
probability tending to 1 as n — t- 00. Now, for every 1 <i <k, the probability 
that the erased branching random walk (erased KBRW2) started from Vi 
has a descendant that hits [v + KL, v + [K + 1)L] in fewer than 6 log L steps 
is at least 0{c){l — o(l)) > 9{c)/2 for n large enough (depending solely on e 
and c). Thus we deduce that, on the event Av, the number of particles that 
hit [v + KL, V + (K + 1)L] stochastically dominates a binomial random vari- 
able Bin((5/o(L),^(c)/2). By applying Chebyshev's inequality, this is with 
high probability greater than 6fo{L)6{c)/4 > L^ for all n sufficiently large 
(depending on c, e, Q, ('). When this occurs, Bk,v holds, so the result follows. 
□ 

Let c > 1, e > and fix = Kq{c,£) as in Lemma 11. We now prove 
that if the connected component of a given vertex is not finite then it must 
spread more or less uniformly over V. As desired, this is achieved by keeping 
a density of explored sites small, lower than l/oj{L). Let v gV and split the 
vertex set V into r + 1 = \n/{KL)~\ disjoint strips {Io,...,Ir) of size KL, 
except for the last one which may be of size smaller than KL. Let Jj denote 
the initial segment of /j of length L. Since L{n) > (logn)^"*^^ for some pos- 
itive ^ by assumption, we may find < 1/2 such that L{n) > (|logn)^/^. 
Let C' = (C + l/2)/2, hence C < C < 1/2. 
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Lemma 12. With the above notation, assume that no more than Lp'^' 
vertices have already been exposed in each strip lQ,...,Ir. Let Ck,v denote 
the event that v is connected to at least k = L'^ vertices in strips L^,..., Ir-i, 
which may be discovered without exposing more than an additional ver- 
tices in each strip, and that in each Ji,3 < i < r — 1, at least k/2 vertices 
connected to v are unexplored at the end of the search procedure. Then 

nCK,v)>e{c){l-e) 

for all n large enough (depending solely on e and c), and provided L < 
n/{2K). 

Proof. This is basically proved by iterating Lemma IL We can assume 
that V is in strip Ii. In the first step we explore Cy using KBFS2 killing at 
the boundary of /q U /i U /2- The arguments of Lemma 11 still carry through 
to obtain that B^j^.^ holds with probability at least 9{c){l — e) where 5^^, 
is the event that v is connected to at least k unexplored vertices in J3, and 
fewer than vertices are explored in finding them. 

Then define inductively for 1 < i < r — 4, B'^j^ ^ = B^j^l H C]^ ^, where C]^ ^ 

is defined as follows. On B^^^l, let vi,...,Vk be a list of k vertices in the 
range 7^+2 that are the first to be discovered in this search procedure in 
this range. Then C)^^, is the event that we can find at least k connections 

between vi, . . . ,Vk and without exploring more than an additional L^^' 
new vertices in Ij_|_2 U Ij+s . 

This is where it starts to pay off to allow for the exploration to unfold 
in a partially revealed environment in Lemma 11. Indeed, let i > 1 and 
condition on B}^^. We reserve (i.e., do not explore further) ffc/2+1, • ■ • ji'fc- 
We explore successively the components Cy-^ , . . . , Cy^^^ , each time perform- 
ing KBFS2 of Lemma 11. Since we never reveal more than L^' vertices at 
each of those k/2 steps, and since we did not reveal more than {k/2)L^' = 
/,C+C I2 < L^^ /2 other vertices in IiJ^2 previously (since C*~^ holds), we see 
that the search may be coupled with high probability (depending solely on c 
and e) to {k/2) erased KBRW2 started at ^^i, . . . ,ffc/2- Thus the total num- 
ber of connections between vi, . . . ,vi^j2, to Jj+3 is dominated from below 
by feBin(/c/2, 0(c)/2). Indeed, for each of {k/2) trials there is a probability 
9{c){l — £)> 6{c)/2 of success (by Lemma 11), in which case k connections 
are added. Thus, using standard Chernoff bounds on binomial random vari- 
ables, 

¥{{B),J\B^^l)=¥{{Ci,^,f\B^^l) 

<P(/c-Binomial(A;/2,6l(c)/2) <k) 
<exp(-^(c)A;/4). 
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It follows by easy induction that for all n large enough, letting = 

nc'K,.)>{^-e-'"ynB%). 

Since L > ((4/61) logn)i/C, r = \n/{2LK)] and k = L?, it follows that (1 - 
e-ok/y ^ exp{-re-^^/^) > exp(-r/?i) 1. Vertices can only be discovered 
during two consecutive steps of the proof, and hence, the total number of 
vertices discovered in each strip is no more than L^^ . Thus Ck,v ^ C'k v ■ "^^^ 
proof of the lemma is complete. □ 

Lemma 13. Let P„ = {|C„| > logL}. Then for any fixed v £V, 
and for v, w fixed in V, 

Proof. This is a direct consequence of Lemma 5 and the remark fol- 
lowing it. □ 

Lemma 14. Fix c> 1, e > 0. Let v,w be chosen uniformly at random 
in V , and let £ = {C„ = C^} he the event that they are connected. If L < 
n/(5K), and n is large enough (depending solely on e and c) then 

(14) ¥{£^\VynV^)<e. 

Proof. We fix K{c,£) as in Lemma 12. We apply this lemma a first 
time by exploring as specified in this lemma with a set of forbidden 
vertices (vertices previously explored) being empty. We then let F be the 
set of all vertices explored during that procedure. 

We apply one more time Lemma 12 by exploring using a set of for- 
bidden vertices given by F (which must necessarily satisfy the assumptions 
of Lemma 12, since the search of Cy did not reveal more than L^^' vertices 
in each strip). Note that conditionally given Vy CiVyj, both Ck,v and Ck,w 
must hold with high probability (depending solely on c) . Let us show that £ 
must then hold with high probability. 

Since Ck,v and Ck,w hold, we know that each interval Ji,3 <i <r — 1, 
contains at least L^/2 unexplored vertices from both Cy and C^. We now 
apply Lemma 10 repeatedly, starting from each of these unexplored ver- 
tices. Since L < n/5K, we have that r > 4. While fewer than 6fo{L) vertices 
have been reserved in Jj, we know that fewer than fo{L) vertices have in 
total been explored and thus Lemma 10 can still be applied. We deduce 
that (conditionally given T>y D'Dy^) in each Ji,3 < i < r — 1, with probability 
greater than 1 — o(l) depending solely on e and c, there are 5L/uj{L) re- 
served vertices from Cy and 5L/oj{L) reserved vertices from Cy,, with 6 as 
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in Lemma 10. Thus at least one Ji (say Ji) contains 5L/uj{L) unexplored 
vertices from both and Cw The probability to not observe a connection 
between these {SL/uj{L))'^ pairs of vertices inside Ji is at most (by revealing 
only the status of the edges connecting each such pair) 



for all n sufficiently large. Thus C^, = Cyj with high probability (depending 
solely on e and c) given P^, nP^, and hence, £ holds with high probability 
(depending solely on e and c) given n Vyj . □ 

We deal with the case L > n/{5K) separately. 

Lemma 15. Fix c > 1, e > 0. Let v,w be chosen uniformly at random 
in V, and let E = {Cy = C^} be the event that they are connected. If L> 
n/{5K), and n is large enough (depending solely on e and c) then 



Proof. We let £ denote the event that we can explore Cy until at 
most L^'""^ vertices have been exposed finding at least L^'^ reserved ver- 
tices and also can explore Cw until at most L^''^^ vertices have been ex- 
posed finding at least k = L^'"^^ reserved vertices. By a simple modification 
of Lemma 10, the probability of this event given DyODw is at least 1 — o(l). 

Let us show that, given the above event £, Cy and Cyj intersect with 
high probability. We partition {1, . . . ,n} into s = 5K + 1 disjoint intervals 
of size less than or equal to L. On the above event, by the pigeonhole prin- 
ciple there must be at least one region of size at most L, denoted /, with 
more than L^'^/s reserved vertices from Cy. We denote by wi, . . . ,Wk the k 
reserved vertices from Cy,. For each 1 <i < k, we continue to explore Cy,i 
by breadth-first search for 6(s -|- 1)^ generations, or until a descendent is 
observed in interval /. Since s depends only on e and c, with probability at 
least 1 — o(l) depending on e and c, no self-intersections occur throughout 
this evolution. We claim that the probability the evolution of Cy,. results in 
us finding a descendent in / is at least 6{c)/3 for n large enough depend- 
ing solely on e and c. Indeed this occurs if we can find a ray emanating 
from Wi where the corresponding random walk goes around the circle in 
less than 6(s -|- 1)^ levels. We let {Xj)j>i denote the location of a random 
walk on Z which starts at and where the jump distribution is uniform on 
{— L, . . . , —1, 1, . . . , —L}. It is clear that (^j)j>o is a martingale, as is 





(15) 



F{£^;VynV^)<e. 



(X|-iL(L + l)(2L + l)/(6L)) 



i>o- 
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Letting T denote the time the walk goes above sL, or below —sL, we see 
that by optional stopping E(T) < 3{s + 1)^. Thus, by Markov's inequality, 
P(T > 6{s + 1)^) < 1/2. Hence, a random walk on V with jumps uniformly 
distributed on {—L, . . . , —1, 1, . . . , —L} goes around the circle in less than 
6(s + 1)^ steps with probability at least 1/2. It follows that the desired ray 
exists with probability at least (1 — o(l))0(c)/2, depending solely on e and c. 
Thus, given £, we can find L^'"^^ reserved vertices from and L^'^ reserved 
vertices from C„ in the interval /, with probability 1 — o(l) depending solely 
on e and c. Looking one level further, the number of connections between Cy 
and Cw in this region is Bin(L^''^^, c/(2L)) which is larger than 1 with 
probability 1 — o(l) depending solely on e and c. Equation (15) now follows. 
□ 

We are now ready to finish the proof of (ii) in Theorem 1 . 

Proof of Theorem l(ii). Let v,w be chosen uniformly on V. Let 
£ = {Cy = Cyj} be the event that they are connected. 

Consider W = {v €V -.Vy holds}. We already know that E(|VF|/n) 9 
and E{\W\'^/n'^) 9"^, so that 




n 



in probability. Furthermore, observe that if v,w are uniformly chosen in W, 
then Cy = Cw with high probability depending solely on c by Lemmas 14 
and 15. Also, if u G W , then clearly Cy C W . Hence, W consists of a union 
of clusters. Let X„ denote the size of a cluster from W chosen according to 
size-biased picking, that is, Xn =d |C^|/|Vl^|, where v is chosen uniformly at 
random in W. It is a well-known consequence of exchangeability (and easy 
to see) that 

F{Cy = Cy,\v,wGW) = EiXr,). 

By (14), if L < n/{4LK) [resp., by (15) if L > n/{4LK)], we have that 

F{Cy = C^\v,weW)^l, 

hence, E(X„) — )■ 1. Since Xn < 1, it follows that Xn — )■ 1 in probability. This 
implies that, for all e > 0, with high probability depending solely on e and c, 
W contains a component of size at least ^^(1 — e) >^(c)(l — 2e). This proves 
the existence of a giant component of mass relative to V equal to 6 in the 
limit n — 7- oo. 

Let us show that all other components are small. Note that by the above, 
we already know that the second largest component size, is such that 
L^^/\W\ — )• in probability. Hence, Ly^/n — )• as well. Let be the 

largest component size in W^. By definition, is smaller in size than 
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logL. Since < max(L^Q,L^), we conclude that 



n 



in probability, as desired. The proof of (ii) in Theorem 1 is complete. □ 
We now conclude with the proof of (iii) in Theorem 1. 



Proof of Theorem l(iii). Since L = o(logn), we have L< ||logn, 



with a > as in the statement of (iii). We begin by dividing 1, 

, 1— a 1 



,n mto 

n"""'logn disjoint intervals of size n"/logn, labeled Ai, . . . , A^i-ai^gn- 
each interval we show that we can find an interval of size L, none of whose 
vertices have been involved in a transposition by time t with high probability. 
We show in fact, that all the n^~"logn intervals contain such a sub-interval 
with high probability. Thus the largest component must be of size smaller 
than 2n"/log?i, and hence, in particular, there will be no giant components. 

For a given interval of size L, the number of potential edges connected to 
vertices in this interval is 2L^ — (2) = (3L^ + L)/2. Each of these edges is 
present with probability c/(2L). We call the interval empty if none of the 
edges are present. The probability a given interval of size L is empty is 

(l-c/(2L))(^^^+^)/^~exp(-|(3L + l)). 

We divide each Ai into [n"/(Llogn)J intervals of size L, denoted 

Al a2 .In'^/iLlogn)] 
/Ij , /Ij , . . . , /i^ 

Let Sf' = {Af^ is empty}. We consider the set of events 
{5f,l<fc<iK/(^logn)J}, 

which are independent since each interval is at distance at least L from any 
other. For each i, we let 

K/{ilogn)J/2 



E 

k=l 



We have 



and so 



\Bi > 0) ~ 1 - exp 



> for all 1 < i < n^'" log 



1 




g-c(3L+l)/4 


~2 


L log n 





n 



' exp 
> 1 



-n^ " log n exp I 







(4 


Llogn 



-c(3L+l)/4 



since L < 



as n —7- 00 

logn. The proof of Theorem 1 is complete. 



□ 



EFFECT OF SCALE 



29 



5. Proof of Theorem 2. We will prove a stronger result than Theorem 2 
by allowing the distribution of edge-lengths to be more general. Recall the 
definitions of {pe) and given at the beginning of Section 3. Let (rj)j>i 
be a sequence of i.i.d. transpositions with ri = (i j) where i,j are chosen 
uniformly from {ti, G y : ||n — = L}- Then we construct the permutation 
ai = Ti o ■ ■ ■ o TTVt, where {Nt^t > 0) is an independent Poisson process. In 
words, at rate 1, we transpose two markers at random with distance D, 
where D is chosen according to the distribution {pt). We recover the process 
(ft > 0) when {p() is the uniform distribution on {!,... ,L(n)}. 

Theorem 6. Assume e„ — )• as n ^ oo. Then we have the following 
convergence in probability as n — )• oo; for all c> 0, 

n ' 

There is a natural coupling between the process {(Jt,t > 0) and the random 
graph {G{t),t > 0) defined in Section 3. The coupling is an adaptation of 
the coupling with the Erdos-Renyi random graph in Berestycki and Durrett 
(2006). Consider the following procedure. Initially, Gq consists of isolated 
vertices. Suppose that at time t, a transposition r = {i,j) is performed. 
If G{t) already contains the edge we do nothing, else we add it to the 
graph. 

The relationship between at and G{t) is not one-to-one; however, the fol- 
lowing deterministic observation holds as in Berestycki and Durrett (2006). 
For every t > 0, every cycle of at is a subset of a certain connected compo- 
nent of G{t). That is, the partition of V obtained from considering the cycle 
decomposition of at is a refinement of the partition obtained from consid- 
ering the connected components of G{t). This is easily proved by induction 
on the number Nt of transpositions up to time t, after observing that the 
cycle decomposition of at undergoes a coagulation-fragmentation process. 
Indeed, every transposition that involves two particles from the same 
cycle yields a fragmentation of that cycle, while if the two particles are in 
distinct cycles they merge. 

This coupling is the basis of our proof. Armed with Lemmas 2 and 3, in 
order to prove Theorem 6 we need to show that Kt and \at\ differ by o(n) 
(Lemma 16), where we recall that \a\ is the number of cycles of the permu- 
tation a. 

Lemma 16. Assume e„ — t- 0. Let t = cn/2, where c> 0. As n — )• 00, 



n 

in probability. 

Proof. This argument is somewhat analogous to the proof of Lemma 6 
in Berestycki (2011). First we note that by the properties of the coupling 
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between at and G{t), it is with probability 1 the case that Kt < \at\. To prove 
a bound in the converse direction, we need to distinguish between small and 
large cycles or components. We say that a cycle of at or a component of G{t) 
is small if it has a size less than and large if it has size at least 

Note that the number of large cycles and the number of large components 
is at most ny/e^ = o(n). It thus suffices to control the difference between the 
number of small cycles and the number of small components. However, note 
that at any time, the probability of generating a small cycle by fragmentation 
is at most 4(l/y^^)e„. To see where this comes from, suppose the current 
permutation is at = a, and the first position for the transposition to 
be performed has been chosen. Thus j will be one of the n — 1 other vertices 
chosen according to the distribution (pi). Then to produce a cycle of size 
exactly k, j must be equal to a^~^{i) or a~^~^^{i). (Depending on the exact 
size of the cycle containing i, there may be two other points allowed.) Thus, 
conditioning on the point i, the probability of creating a fragment of size 
smaller than is at most 4:{l/y/e^)en, as claimed. It follows that since 

each excess small cycle must have been generated by such a fragmentation 
at some time s <t, and since transpositions occur at rate 1, 



as n — )• oo. The proof is complete. □ 

Proof of Theorem 6. The proof of Theorem 6 now follows directly from 
Lemmas 2, 3 and 16. Indeed, S{at) =n — \at\. By Lemma 16, n-HWt\-Kt) 
tends to in probability. We have concentration of Kt around its mean 
by Lemma 3, and the mean is obtained in Lemma 2. Putting these pieces 
together we obtain Theorem 6. □ 

6. Proof of Theorem 3. We consider the case where L is bounded (say 
by some constant C) and show that if t = cn/2 with c > 0, then 5{t)/n is 
bounded away from c/2, where we write 5{t) = 5{at)- 

Lemma 17. Assume L is bounded. Fix c > and let t = cn/2. Then 
there exists ri = r]c> such that 6{t) < (1 — ri)cn/2 with high probability. 



n\at\-Kt]<n^,^ + 4t^/I^. 
Thus by Markov's inequality, taking t = cn/2, for all 6 > 0, 





In the statement above and in what follows, the expression with high 
probability means with probability tending to 1 as n— )• oo. 
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Proof of Lemma 17. Since each transposition decreases the number 
of cycles by 1 if there is a coagulation and increases it by 1 if there is 
a fragmentation, we have 

(16) 6{t) = Nt-2Ft, 

where Ft is the total number of fragmentations by time t. It suffices to show 
that Ft > rjn when t = cn/2, for some 77 > 0. Let £ TZl- Consider the 
event Aij that the transposition occurred twice by time t, and that 

no other transposition involved either i or j by time t. There are 4L — 2 
possible transpositions involving i or j but not both, with each occurring at 
rate l/{nL). Thus the number of such transpositions that occur by this time 
is Poi(t(4L — 2)/nL) which has a positive probability, Qc, of being 0. Further, 
the number of times transposition occurs by time t is Poi(t/nL) and 
thus we have a positive probability, Pc, of it occurring exactly twice. Thus 
= QcPc > for each £ TZl- 
Moreover, the events {A2iL,2iL+i)o<i<ln/2L\-i are independent and each 
occurs with probability qcPc- Note that the number Ft of fragmentations 
satisfies 

[n./2LJ-l 
i=0 

It thus follows from Chebyshev's inequality that P(-Ft > nqcPc/ {^C)) — t- 1, 
where C is an upper-bound on L. Hence, Ft > rjcU with rjc = qcPc/{^C)- 
Plugging back in (16) completes the proof. □ 

We now turn toward the proof of Theorem 3. Assume without loss of 
generality that L{ri) = Lis constant. As Nt =d Poisson(t), we obtain directly 
from Chebyshev's inequality that c/2 in probability. 

It thus suffices to show that also has a limit as n — )■ 00. Let 

{^t)t>o be the filtration associated with the entire history of the process; 
that is, Tt = cr{Ti,i < Nt). For s > 0, let gn{s) denote the Js-measurable 
random variable giving the instantaneous rate of fragmentation given ag- 
Let 

At= gn{s)ds 
Jo 

and observe that if Mt = Ft — At, then {Mt,t > 0) is a martingale with 
respect to the filtration {Tt)t>o, for each n > 1. 

We prove convergence of n~^Ft (with t = cn/2) in two steps: 

(i) n~^At converges, 

(ii) n~^Mt —7- in probability, which will follow from Doob's inequality. 
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Note first that by a change of variable, 

1 1 r 

(17) -^cn/2 = o / 9n{sn/2)ds. 

n ^ Jo 

Lemma 18. There exists a nonrandom function g{s) such that 

^\!^9{s)ds. 

Proof. Since gn{s) < 1 almost surely, it suffices to show (by Fubini's 
theorem and Lebesgue's dominated convergence theorem) that E(5„(s?i/2)) — t- 
g{s) for all fixed s > 0. Let Cg be the cycle of ag containing the origin. By 
exchangeability, note that 

E(5„(sn/2))=P(^'GC'W2)' 
where v is chosen uniformly among the 2L — 1 neighbors of 0. Fix such 
a neighbor v. The idea for the proof of this lemma is that the cycle struc- 
ture of V can be coupled with the cycle structure of the origin in a random 
transposition process on the infinite line Z, rather than on the torus. More 
precisely, let Goo be the graph where the vertex set is Voo = Z and the edge 
set is Eoo = {{i,j) £ ^ x Z, |z — j| < L}. Consider the process {crf°,t > 0), 
with values in the permutation of Voo, obtained by transposing each edge 
{i,j) G -Eoo at rate 1/(2L). It is not obvious that this process is well defined 
as there are an infinite number of edges. However, the process may be con- 
structed using a standard graphical construction [see, e.g., Liggett (1985)]. 
Briefly speaking, for every (nonoriented) edge e G E^o, consider an indepen- 
dent Poisson process which rings at rate 1/(2L). Then the value a'j^{w) 
is defined for every t > and w S Voo by following the trajectory between 
times and t of a particle which is initially on w and moves to a neighbor j 
of its current position i each time the edge e = (z, j) rings. It is easy to 
see (and will be shown below) that almost surely there are empty patches 
(where no edge has rung) surrounding the origin. Thus the trajectory cannot 
accumulate an infinite number of jumps in a compact interval, and hence, 
is well defined. Moreover, the cycle of the origin in contains only 
finitely many points almost surely for s > 0, since it must be contained in 
between two empty patches. 

Let c > 0. We claim that there is an event Q = such that P(^) — )• 1 
as n — )• oo and such that on Q, Csn/2 are identical. (Here we use 

the obvious identification V = Z/nZ as a subset of Z, as V = { — [n/2\ + 
1,...,-1,0,1,..., [n/2j}.) We choose g{s) =F{v e C^). 

The event G we choose is 

Gn = {Cu C [— logn, logn] for all u < sn/2}. 

The coupling between Cgn/2 'iiid is obvious on Gn since we can use the 
same graphical construction for both at and af° . It remains to show that 
¥{G) — 7- 1. To do this it suffices that there is a strip of size at least L in 
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[— logn,0] and in [O,logn] where each vertex in the strip has never been 
involved in a transposition by time sn/2 (we say that such a vertex has 
degree 0), what we called earlier an empty patch. A given interval of size L 
contains exactly L{2L — 1) — (2) = (3L^ — L)/2 distinct edges, hence, the 
probability that it is an empty patch is 

exp(^--^^J =:p(^)>0. 

If some patches of size L share no edge in common, then the events that 
they are empty are mutually independent. Since we can find at least alogn 
distinct patches that do not share any edge in [O,logn], for some a > 
depending only on L, the probability that there is no empty patch in [0, logn] 
is at most (1 — p(s))"'°s" — 0. Hence, P(^n) — ^ 1 and Lemma 18 is proved. 
□ 

Lemma 19. Var(iAcrt/2) — > as n — ;> 00. 

Proof. Using (17) and Cauchy-Schwarz's inequality, 

Var^^Ac„/2^ < \ar{gn{sn/2))ds. 

Since gn{s) < 1, it suffices to show that Var((7„(sn/2)) — t- for all fixed s > 0. 
Now, note that 

gnisn/2) = - V/, 

where 



n 

vev 



- 2L - 1 ^ ^{ 



«jeC„(sn/2)}' 

^w—v\\<L 

and where Cy{s) denotes the cycle containing v in ag- Let 

Av = {Cv{r) C [v — logn, V + logn] for all r < sn/2}, 

where the addition and substr action is done modulo n. If ||f — > 2 logn, 
then on Av n A^' the random variables fy and /„/ may be taken to be 
independent. Reasoning as in Lemma 3 shows that Var(^ — )■ 0, since 
by Lemma 18 we know that ¥{Av H Ay') — )• 1. □ 

Our final step is to show that converges in probability to 0. 

Lemma 20. For all e > 0, 



sup I Ms I > en ) — )• 0. 

s<cn/2 
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Proof. By Markov's inequality, 

p(sup|M,/n| >e)= p(sup|M,/np > e^) 

^ E(sup,<jM,/np) 

£2 

4E(M2/n2) 

by Doob's inequality. Now note that since Mt is a martingale whose jumps 
are only of size 1, 

^1- [ 9n(.s)ds 

Jo 

is again an (J^t)t>o-martingale. [To see this, observe that F^-i is Poi(i) and 
hence, -^^-i — t is a martingale.] Thus K{M^ ) < t for all t>0 and when 
t = cn/2, 

p( sup \Ms\ >en)<^-^0 

as claimed. □ 

Proof of Theorem 3. It follows from Lemmas 18 and 19 that 

1 lo s(^) ™ probability, where g{s) = F{v G C^) has been defined in Lem- 
ma 18. By Lemma 20, we deduce that 

1 If 

/ 9{s)ds 
^ Jo 

in probability. Since 5{t) = Nt — 2Ft for all t > 0, it follows that 

1 c 

-6{cn/2) -^v{c) = - - g{s)ds. 

n 2 Jo 

By Lemma 17, we must have v{c) < c/2 for all c > 0. It thus suffices to show 
that g is continuously differentiable on [0, oo). Assume that the process (o"^) 
is in some state such that the (finite) cycle C containing also contains v. 
Let /i(C) denote the instantaneous rate at which v becomes part of a dif- 
ferent cycle; note that this rate depends indeed only on C and not on the 
rest of uf^, and satisfies /i(C) < \C\'^/(2L). Likewise, assume that the cycle 
containing v, C , is distinct from C. Let /2(C, C") be the instantaneous rate 
at which these cycles merge. Then /2(C,C") < \C\ x \C'\/{2L). 

Note that \C^\ > k implies that there are [k/L\ consecutive intervals of 
size L around all containing at least one edge in the associated percolation 
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process. By considering every other interval, this imphes that we can find 
\k/{2L)\ disjoint intervals of size L, all of which contain at least one edge. 
Such events are independent, and hence, if Poo(c) > is the probability 
that at time c an interval of size L is an empty patch, we find (summing 
over at most k possible locations for the leftmost point of this sequence of 
consecutive intervals), 

P(|C'ri>A:)<A;(l-poo(c))L'=/'^J, 

so that \C^\ has exponential tails. It follows directly that EdC^p) < oo, 
and if C'^{v) denotes the cycle containing v at time c, £(1(7^11(7^(^)1) < oo 
by Cauchy-Schwarz's inequality. A routine argument thus shows that 

By the same arguments, we see that g'{c) is continuous, which in turn shows 
that V is continuously twice differentiable. The proof of Theorem 3 is com- 
plete. □ 
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